Overview

Dataset statistics

Number of variables20
Number of observations5644
Missing cells96719
Missing cells (%)85.7%
Duplicate rows5002
Duplicate rows (%)88.6%
Total size in memory977.7 KiB
Average record size in memory177.4 B

Variable types

NUM17
BOOL1
CAT1
UNSUPPORTED1

Reproduction

Analysis started2020-04-12 15:42:18.110027
Analysis finished2020-04-12 15:44:07.275854
Versionpandas-profiling v2.5.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
Dataset has 5002 (88.6%) duplicate rows Duplicates
Hematocrit has 5041 (89.3%) missing values Missing
Platelets has 5042 (89.3%) missing values Missing
Leukocytes has 5042 (89.3%) missing values Missing
Serum Glucose has 5436 (96.3%) missing values Missing
Urea has 5247 (93.0%) missing values Missing
Proteina C reativa mg/dL has 5138 (91.0%) missing values Missing
Creatinine has 5220 (92.5%) missing values Missing
Alanine transaminase has 5419 (96.0%) missing values Missing
Aspartate transaminase has 5418 (96.0%) missing values Missing
Total Bilirubin has 5462 (96.8%) missing values Missing
Urine - Leukocytes has 5574 (98.8%) missing values Missing
International normalized ratio (INR) has 5511 (97.6%) missing values Missing
Lactic Dehydrogenase has 5543 (98.2%) missing values Missing
D-Dimer has 5644 (100.0%) missing values Missing
pH (arterial blood gas analysis) has 5617 (99.5%) missing values Missing
HCO3 (arterial blood gas analysis) has 5617 (99.5%) missing values Missing
pO2 (arterial blood gas analysis) has 5617 (99.5%) missing values Missing
Neutrophils/Lymphocytes ratio has 5131 (90.9%) missing values Missing
D-Dimer is an unsupported type, check if it needs cleaning or further analysis Rejected
Patient age quantile has 334 (5.9%) zeros Zeros

Variables

Patient age quantile
Real number (ℝ≥0)

ZEROS
Distinct count20
Unique (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.318391212
Minimum0
Maximum19
Zeros334
Zeros (%)5.9%
Memory size44.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q14
median9
Q314
95-th percentile18
Maximum19
Range19
Interquartile range (IQR)10

Descriptive statistics

Standard deviation5.777903287
Coefficient of variation (CV)0.6200537363
Kurtosis-1.213257198
Mean9.318391212
Median Absolute Deviation (MAD)5.022219114
Skewness0.03462259201
Sum52593
Variance33.3841664
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 3.5 4.5 7.5 ... 10.5 11.5 12.5 18.5 19. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
11 380 6.7%
 
4 366 6.5%
 
9 359 6.4%
 
0 334 5.9%
 
7 319 5.7%
 
2 315 5.6%
 
13 313 5.5%
 
14 299 5.3%
 
5 294 5.2%
 
6 281 5.0%
 
Other values (10) 2384 42.2%
 
ValueCountFrequency (%) 
0 334 5.9%
 
1 234 4.1%
 
2 315 5.6%
 
3 251 4.4%
 
4 366 6.5%
 
ValueCountFrequency (%) 
19 275 4.9%
 
18 259 4.6%
 
17 263 4.7%
 
16 279 4.9%
 
15 269 4.8%
 

Hematocrit
Real number (ℝ)

MISSING
Distinct count176
Unique (%)29.2%
Missing5041
Missing (%)89.3%
Infinite0
Infinite (%)0.0%
Mean-2.186214104e-09
Minimum-4.501419544
Maximum2.662703753
Zeros0
Zeros (%)0.0%
Memory size44.2 KiB

Quantile statistics

Minimum-4.501419544
5-th percentile-1.752501726
Q1-0.5188073516
median0.05340702832
Q30.7171751261
95-th percentile1.403832316
Maximum2.662703753
Range7.164123297
Interquartile range (IQR)1.235982478

Descriptive statistics

Standard deviation1.000830214
Coefficient of variation (CV)-457791490.6
Kurtosis1.447843764
Mean-2.186214104e-09
Median Absolute Deviation (MAD)0.7750989634
Skewness-0.7318291006
Sum-1.318287104e-06
Variance1.001661116
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-0.473030895 10 0.2%
 
0.6942868829 9 0.2%
 
0.5569549203 9 0.2%
 
0.1907381266 9 0.2%
 
-0.1068131626 9 0.2%
 
-0.4959191084 9 0.2%
 
-0.2441451252 9 0.2%
 
0.488289386 8 0.1%
 
0.2365154475 8 0.1%
 
-0.1297022551 8 0.1%
 
Other values (166) 515 9.1%
 
(Missing) 5041 89.3%
 
ValueCountFrequency (%) 
-4.501419544 1 < 0.1%
 
-4.066536427 1 < 0.1%
 
-3.608765364 1 < 0.1%
 
-3.540099382 1 < 0.1%
 
-3.334102154 1 < 0.1%
 
ValueCountFrequency (%) 
2.662703753 1 < 0.1%
 
2.433818102 1 < 0.1%
 
2.410929918 1 < 0.1%
 
2.136266708 1 < 0.1%
 
2.090489388 1 < 0.1%
 

Platelets
Real number (ℝ)

MISSING
Distinct count249
Unique (%)41.4%
Missing5042
Missing (%)89.3%
Infinite0
Infinite (%)0.0%
Mean-3.535003563e-10
Minimum-2.5524261
Maximum9.53203392
Zeros0
Zeros (%)0.0%
Memory size44.2 KiB

Quantile statistics

Minimum-2.5524261
5-th percentile-1.371616006
Q1-0.6053456664
median-0.121716015
Q30.5314980745
95-th percentile1.642590189
Maximum9.53203392
Range12.08446002
Interquartile range (IQR)1.136843741

Descriptive statistics

Standard deviation1.000831594
Coefficient of variation (CV)-2831203917
Kurtosis13.49290414
Mean-3.535003563e-10
Median Absolute Deviation (MAD)0.7332537061
Skewness1.795130187
Sum-2.128072119e-07
Variance1.001663879
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-0.4546039701 9 0.2%
 
-0.2787386179 8 0.1%
 
-0.1782441586 7 0.1%
 
-0.002378814388 7 0.1%
 
-0.39179492 7 0.1%
 
-0.1656823456 6 0.1%
 
-0.04006424174 6 0.1%
 
0.286542803 6 0.1%
 
-0.1279969215 6 0.1%
 
-0.5174130201 6 0.1%
 
Other values (239) 534 9.5%
 
(Missing) 5042 89.3%
 
ValueCountFrequency (%) 
-2.5524261 1 < 0.1%
 
-2.313751698 1 < 0.1%
 
-2.276066303 1 < 0.1%
 
-2.075077295 2 < 0.1%
 
-2.062515497 1 < 0.1%
 
ValueCountFrequency (%) 
9.53203392 1 < 0.1%
 
3.376748085 1 < 0.1%
 
3.276253462 1 < 0.1%
 
3.03757906 1 < 0.1%
 
2.999893665 1 < 0.1%
 

Leukocytes
Real number (ℝ)

MISSING
Distinct count475
Unique (%)78.9%
Missing5042
Missing (%)89.3%
Infinite0
Infinite (%)0.0%
Mean6.215832763e-09
Minimum-2.020302534
Maximum4.522041798
Zeros0
Zeros (%)0.0%
Memory size44.2 KiB

Quantile statistics

Minimum-2.020302534
5-th percentile-1.235415918
Q1-0.637254715
median-0.2128790095
Q30.4542954564
95-th percentile1.958394533
Maximum4.522041798
Range6.542344332
Interquartile range (IQR)1.091550171

Descriptive statistics

Standard deviation1.000831604
Coefficient of variation (CV)161013277.2
Kurtosis2.950854935
Mean6.215832763e-09
Median Absolute Deviation (MAD)0.735927396
Skewness1.4290192
Sum3.741931316e-06
Variance1.0016639
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-0.4201970398 6 0.1%
 
-0.07513076067 5 0.1%
 
-0.3589755595 3 0.1%
 
-0.9628413916 3 0.1%
 
-0.08069633693 3 0.1%
 
0.3506364822 3 0.1%
 
-0.2615778446 3 0.1%
 
-0.7485664487 3 0.1%
 
-0.6929104924 3 0.1%
 
-0.8849231601 3 0.1%
 
Other values (465) 567 10.0%
 
(Missing) 5042 89.3%
 
ValueCountFrequency (%) 
-2.020302534 1 < 0.1%
 
-1.928470135 1 < 0.1%
 
-1.828289747 1 < 0.1%
 
-1.789330602 1 < 0.1%
 
-1.733674765 1 < 0.1%
 
ValueCountFrequency (%) 
4.522041798 1 < 0.1%
 
4.455255032 1 < 0.1%
 
4.224282742 1 < 0.1%
 
3.779036045 1 < 0.1%
 
3.609285593 1 < 0.1%
 

Serum Glucose
Real number (ℝ)

MISSING
Distinct count71
Unique (%)34.1%
Missing5436
Missing (%)96.3%
Infinite0
Infinite (%)0.0%
Mean7.069992047e-09
Minimum-1.109751225
Maximum7.006487846
Zeros0
Zeros (%)0.0%
Memory size44.2 KiB

Quantile statistics

Minimum-1.109751225
5-th percentile-0.815991801
Q1-0.5040617585
median-0.2920704484
Q30.1394832917
95-th percentile1.549225539
Maximum7.006487846
Range8.116239071
Interquartile range (IQR)0.6435450502

Descriptive statistics

Standard deviation1.002412561
Coefficient of variation (CV)141784114.4
Kurtosis20.43113959
Mean7.069992047e-09
Median Absolute Deviation (MAD)0.5928936155
Skewness3.830783892
Sum1.470558345e-06
Variance1.004830942
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-0.3526394069 16 0.3%
 
-0.4132083654 11 0.2%
 
-0.2920704484 7 0.1%
 
-0.1406480819 7 0.1%
 
-0.04979466274 7 0.1%
 
-0.6554841399 6 0.1%
 
-0.1103636101 6 0.1%
 
-0.4434928298 6 0.1%
 
-0.5040617585 6 0.1%
 
-0.3829238713 6 0.1%
 
Other values (61) 130 2.3%
 
(Missing) 5436 96.3%
 
ValueCountFrequency (%) 
-1.109751225 1 < 0.1%
 
-1.049182296 3 0.1%
 
-0.9280443788 2 < 0.1%
 
-0.86747545 2 < 0.1%
 
-0.8371909261 3 0.1%
 
ValueCountFrequency (%) 
7.006487846 1 < 0.1%
 
6.491652012 1 < 0.1%
 
3.705480099 1 < 0.1%
 
3.220928431 1 < 0.1%
 
3.06950593 1 < 0.1%
 

Urea
Real number (ℝ)

MISSING
Distinct count54
Unique (%)13.6%
Missing5247
Missing (%)93.0%
Infinite0
Infinite (%)0.0%
Mean-6.675260423e-09
Minimum-1.630410194
Maximum11.24656868
Zeros0
Zeros (%)0.0%
Memory size44.2 KiB

Quantile statistics

Minimum-1.630410194
5-th percentile-1.034943104
Q1-0.588342607
median-0.1417421997
Q30.4537250102
95-th percentile1.43624599
Maximum11.24656868
Range12.87697887
Interquartile range (IQR)1.042067617

Descriptive statistics

Standard deviation1.001261839
Coefficient of variation (CV)-149995921.5
Kurtosis41.14203352
Mean-6.675260423e-09
Median Absolute Deviation (MAD)0.6470849736
Skewness4.370358124
Sum-2.650078388e-06
Variance1.002525269
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-0.588342607 24 0.4%
 
-0.4394758046 22 0.4%
 
-0.3650423884 21 0.4%
 
0.4537250102 18 0.3%
 
-0.7372094393 16 0.3%
 
-0.06730879098 16 0.3%
 
0.08155801147 16 0.3%
 
-0.662776053 15 0.3%
 
-0.5139092207 15 0.3%
 
-0.2161756009 15 0.3%
 
Other values (44) 219 3.9%
 
(Missing) 5247 93.0%
 
ValueCountFrequency (%) 
-1.630410194 1 < 0.1%
 
-1.555976748 1 < 0.1%
 
-1.481543422 2 < 0.1%
 
-1.407109976 2 < 0.1%
 
-1.332676649 3 0.1%
 
ValueCountFrequency (%) 
11.24656868 1 < 0.1%
 
4.473128796 1 < 0.1%
 
4.175395012 1 < 0.1%
 
3.282194376 1 < 0.1%
 
2.910027266 1 < 0.1%
 

Proteina C reativa mg/dL
Real number (ℝ)

MISSING
Distinct count265
Unique (%)52.4%
Missing5138
Missing (%)91.0%
Infinite0
Infinite (%)0.0%
Mean2.779703396e-09
Minimum-0.5353622437
Maximum8.02667141
Zeros0
Zeros (%)0.0%
Memory size44.2 KiB

Quantile statistics

Minimum-0.5353622437
5-th percentile-0.5333752036
Q1-0.5135051012
median-0.3942843676
Q30.03242637357
95-th percentile1.918597341
Maximum8.02667141
Range8.562033653
Interquartile range (IQR)0.5459314748

Descriptive statistics

Standard deviation1.000989606
Coefficient of variation (CV)360106624.2
Kurtosis17.47033008
Mean2.779703396e-09
Median Absolute Deviation (MAD)0.6165984953
Skewness3.653520946
Sum1.406529918e-06
Variance1.001980191
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-0.5214531422 59 1.0%
 
-0.5353622437 17 0.3%
 
-0.5333752036 12 0.2%
 
-0.4618427753 9 0.2%
 
-0.5115180612 8 0.1%
 
-0.5194661617 8 0.1%
 
-0.5234401226 7 0.1%
 
-0.5015829802 6 0.1%
 
-0.4777388871 6 0.1%
 
-0.5313881636 6 0.1%
 
Other values (255) 368 6.5%
 
(Missing) 5138 91.0%
 
ValueCountFrequency (%) 
-0.5353622437 17 0.3%
 
-0.5333752036 12 0.2%
 
-0.5313881636 6 0.1%
 
-0.5294011831 4 0.1%
 
-0.5274142027 6 0.1%
 
ValueCountFrequency (%) 
8.02667141 1 < 0.1%
 
5.946270466 1 < 0.1%
 
5.733659744 1 < 0.1%
 
5.499192715 1 < 0.1%
 
5.032244682 1 < 0.1%
 

Creatinine
Real number (ℝ)

MISSING
Distinct count119
Unique (%)28.1%
Missing5220
Missing (%)92.5%
Infinite0
Infinite (%)0.0%
Mean-6.679603657e-09
Minimum-2.389998674
Maximum5.053571701
Zeros0
Zeros (%)0.0%
Memory size44.2 KiB

Quantile statistics

Minimum-2.389998674
5-th percentile-1.454383254
Q1-0.6324890256
median-0.08111327887
Q30.5133383721
95-th percentile1.666058201
Maximum5.053571701
Range7.443570375
Interquartile range (IQR)1.145827398

Descriptive statistics

Standard deviation1.001181335
Coefficient of variation (CV)-149886338.6
Kurtosis3.008933511
Mean-6.679603657e-09
Median Absolute Deviation (MAD)0.7429014725
Skewness0.9000916683
Sum-2.832151948e-06
Variance1.002364066
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-0.6324890256 16 0.3%
 
-0.5635669231 13 0.2%
 
0.0567304939 12 0.2%
 
0.2634963691 11 0.2%
 
-0.2878791392 10 0.2%
 
0.09119164199 10 0.2%
 
0.4013403356 9 0.2%
 
-0.6669499278 9 0.2%
 
-0.3223400712 8 0.1%
 
0.3668794036 8 0.1%
 
Other values (109) 318 5.6%
 
(Missing) 5220 92.5%
 
ValueCountFrequency (%) 
-2.389998674 1 < 0.1%
 
-2.252154827 2 < 0.1%
 
-2.217694044 1 < 0.1%
 
-2.183232784 2 < 0.1%
 
-2.148771763 1 < 0.1%
 
ValueCountFrequency (%) 
5.053571701 1 < 0.1%
 
4.605579376 1 < 0.1%
 
3.812976837 1 < 0.1%
 
3.296062469 1 < 0.1%
 
3.158218384 1 < 0.1%
 

Alanine transaminase
Real number (ℝ)

MISSING
Distinct count62
Unique (%)27.6%
Missing5419
Missing (%)96.0%
Infinite0
Infinite (%)0.0%
Mean2.719461918e-09
Minimum-0.6419506073
Maximum7.930662632
Zeros0
Zeros (%)0.0%
Memory size44.2 KiB

Quantile statistics

Minimum-0.6419506073
5-th percentile-0.5592565536
Q1-0.448997885
median-0.2836098373
Q30.1022955626
95-th percentile1.193856621
Maximum7.930662632
Range8.572613239
Interquartile range (IQR)0.5512934476

Descriptive statistics

Standard deviation1.002229667
Coefficient of variation (CV)368539695.4
Kurtosis30.29209005
Mean2.719461918e-09
Median Absolute Deviation (MAD)0.5222613763
Skewness4.96916835
Sum6.118789315e-07
Variance1.004464305
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-0.448997885 15 0.3%
 
-0.5041272044 15 0.3%
 
-0.3938685358 14 0.2%
 
-0.3663038611 13 0.2%
 
-0.4765625596 10 0.2%
 
-0.5592565536 9 0.2%
 
-0.5316919088 8 0.1%
 
-0.1182218194 8 0.1%
 
-0.2836098373 7 0.1%
 
0.1298602372 6 0.1%
 
Other values (52) 120 2.1%
 
(Missing) 5419 96.0%
 
ValueCountFrequency (%) 
-0.6419506073 3 0.1%
 
-0.6143859029 4 0.1%
 
-0.5868212581 2 < 0.1%
 
-0.5592565536 9 0.2%
 
-0.5316919088 8 0.1%
 
ValueCountFrequency (%) 
7.930662632 1 < 0.1%
 
6.442170143 1 < 0.1%
 
5.394712448 1 < 0.1%
 
5.063936234 1 < 0.1%
 
2.886327505 1 < 0.1%
 

Aspartate transaminase
Real number (ℝ)

MISSING
Distinct count51
Unique (%)22.6%
Missing5418
Missing (%)96.0%
Infinite0
Infinite (%)0.0%
Mean-5.439583168e-10
Minimum-0.7041224837
Maximum7.231171608
Zeros0
Zeros (%)0.0%
Memory size44.2 KiB

Quantile statistics

Minimum-0.7041224837
5-th percentile-0.5783190578
Q1-0.4331612289
median-0.2783262134
Q30.03134381399
95-th percentile1.531307966
Maximum7.231171608
Range7.935294092
Interquartile range (IQR)0.4645050429

Descriptive statistics

Standard deviation1.002219764
Coefficient of variation (CV)-1842456919
Kurtosis28.74427773
Mean-5.439583168e-10
Median Absolute Deviation (MAD)0.5126628527
Skewness4.872636219
Sum-1.229345798e-07
Variance1.004444456
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-0.3944524527 17 0.3%
 
-0.4718699753 14 0.2%
 
-0.3557437062 14 0.2%
 
-0.510578692 14 0.2%
 
-0.4331612289 13 0.2%
 
-0.239617452 13 0.2%
 
-0.2783262134 12 0.2%
 
-0.1621999592 12 0.2%
 
-0.549287498 10 0.2%
 
0.1474700719 8 0.1%
 
Other values (41) 99 1.8%
 
(Missing) 5418 96.0%
 
ValueCountFrequency (%) 
-0.7041224837 1 < 0.1%
 
-0.6654137373 3 0.1%
 
-0.6267049909 3 0.1%
 
-0.5879962444 5 0.1%
 
-0.549287498 10 0.2%
 
ValueCountFrequency (%) 
7.231171608 1 < 0.1%
 
6.998919487 1 < 0.1%
 
6.22474432 1 < 0.1%
 
3.515131712 1 < 0.1%
 
3.050626516 1 < 0.1%
 

Total Bilirubin
Real number (ℝ)

MISSING
Distinct count19
Unique (%)10.4%
Missing5462
Missing (%)96.8%
Infinite0
Infinite (%)0.0%
Mean-2.78373341e-09
Minimum-1.093173742
Maximum5.028598785
Zeros0
Zeros (%)0.0%
Memory size44.2 KiB

Quantile statistics

Minimum-1.093173742
5-th percentile-0.7870850563
Q1-0.7870850563
median-0.1749077886
Q30.1311808228
95-th percentile1.661623836
Maximum5.028598785
Range6.121772528
Interquartile range (IQR)0.9182658792

Descriptive statistics

Standard deviation1.002758615
Coefficient of variation (CV)-360220778
Kurtosis7.250373717
Mean-2.78373341e-09
Median Absolute Deviation (MAD)0.6929039194
Skewness2.353336261
Sum-5.066394806e-07
Variance1.00552484
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-0.7870850563 52 0.9%
 
-0.4809963703 29 0.5%
 
-0.1749077886 28 0.5%
 
0.1311808228 25 0.4%
 
0.4372695386 13 0.2%
 
1.049446702 9 0.2%
 
0.7433580756 8 0.1%
 
1.355535269 4 0.1%
 
-1.093173742 3 0.1%
 
1.661623836 2 < 0.1%
 
Other values (9) 9 0.2%
 
(Missing) 5462 96.8%
 
ValueCountFrequency (%) 
-1.093173742 3 0.1%
 
-0.7870850563 52 0.9%
 
-0.4809963703 29 0.5%
 
-0.1749077886 28 0.5%
 
0.1311808228 25 0.4%
 
ValueCountFrequency (%) 
5.028598785 1 < 0.1%
 
4.722510338 1 < 0.1%
 
3.804244518 1 < 0.1%
 
3.498155832 1 < 0.1%
 
3.192067146 1 < 0.1%
 

Urine - Leukocytes
Categorical

MISSING
Distinct count31
Unique (%)44.3%
Missing5574
Missing (%)98.8%
Memory size44.2 KiB
<1000
9
3000
9
4000
7
2000
7
1000
 
4
Other values (26)
34
ValueCountFrequency (%) 
<1000 9 0.2%
 
3000 9 0.2%
 
4000 7 0.1%
 
2000 7 0.1%
 
1000 4 0.1%
 
8000 3 0.1%
 
38000 2 < 0.1%
 
7000 2 < 0.1%
 
10000 2 < 0.1%
 
5000 2 < 0.1%
 
Other values (21) 23 0.4%
 
(Missing) 5574 98.8%
 

Length

Max length7
Mean length3.019135365
Min length3
ValueCountFrequency (%) 
Decimal_Number 10 76.9%
 
Lowercase_Letter 2 15.4%
 
Math_Symbol 1 7.7%
 
ValueCountFrequency (%) 
Common 11 84.6%
 
Latin 2 15.4%
 
ValueCountFrequency (%) 
ASCII 13 100.0%
 

International normalized ratio (INR)
Real number (ℝ)

MISSING
Distinct count42
Unique (%)31.6%
Missing5511
Missing (%)97.6%
Infinite0
Infinite (%)0.0%
Mean-4.733639556e-09
Minimum-1.797149301
Maximum7.369844437
Zeros0
Zeros (%)0.0%
Memory size44.2 KiB

Quantile statistics

Minimum-1.797149301
5-th percentile-0.7446426988
Q1-0.6654217839
median-0.1561441869
Q30.2965464294
95-th percentile1.360370731
Maximum7.369844437
Range9.166993737
Interquartile range (IQR)0.9619682133

Descriptive statistics

Standard deviation1.003780728
Coefficient of variation (CV)-212052632.4
Kurtosis21.97370208
Mean-4.733639556e-09
Median Absolute Deviation (MAD)0.6483969101
Skewness3.545190363
Sum-6.295740613e-07
Variance1.007575751
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-0.6654217839 38 0.7%
 
-0.09955786169 10 0.2%
 
0.2399601042 6 0.1%
 
-0.2693168521 5 0.1%
 
-0.5522491336 4 0.1%
 
-1.23128581 4 0.1%
 
0.692651391 4 0.1%
 
0.0136147989 4 0.1%
 
0.5228924155 4 0.1%
 
0.6360650659 3 0.1%
 
Other values (32) 51 0.9%
 
(Missing) 5511 97.6%
 
ValueCountFrequency (%) 
-1.797149301 1 < 0.1%
 
-1.23128581 4 0.1%
 
-0.7785944939 2 < 0.1%
 
-0.7220081687 1 < 0.1%
 
-0.6654217839 38 0.7%
 
ValueCountFrequency (%) 
7.369844437 1 < 0.1%
 
3.069278955 1 < 0.1%
 
3.01269269 1 < 0.1%
 
2.220483541 1 < 0.1%
 
1.711205959 1 < 0.1%
 

Lactic Dehydrogenase
Real number (ℝ)

MISSING
Distinct count79
Unique (%)78.2%
Missing5543
Missing (%)98.2%
Infinite0
Infinite (%)0.0%
Mean1.73355094e-09
Minimum-1.358584046
Maximum2.950034618
Zeros0
Zeros (%)0.0%
Memory size44.2 KiB

Quantile statistics

Minimum-1.358584046
5-th percentile-1.134588599
Q1-0.6997738481
median-0.3308401704
Q30.4729083478
95-th percentile2.343929291
Maximum2.950034618
Range4.308618665
Interquartile range (IQR)1.172682196

Descriptive statistics

Standard deviation1.00498755
Coefficient of variation (CV)579727729.4
Kurtosis0.7193500415
Mean1.73355094e-09
Median Absolute Deviation (MAD)0.7970021409
Skewness1.131790663
Sum1.75088644e-07
Variance1.009999976
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-1.04235518 4 0.1%
 
-1.134588599 3 0.1%
 
0.4202035069 3 0.1%
 
-0.304487735 2 < 0.1%
 
-0.1200208664 2 < 0.1%
 
0.3411462903 2 < 0.1%
 
-0.9896503687 2 < 0.1%
 
-0.528483212 2 < 0.1%
 
-0.3703687787 2 < 0.1%
 
-0.9501217604 2 < 0.1%
 
Other values (69) 77 1.4%
 
(Missing) 5543 98.2%
 
ValueCountFrequency (%) 
-1.358584046 1 < 0.1%
 
-1.266350627 1 < 0.1%
 
-1.147764802 1 < 0.1%
 
-1.134588599 3 0.1%
 
-1.04235518 4 0.1%
 
ValueCountFrequency (%) 
2.950034618 1 < 0.1%
 
2.739215374 1 < 0.1%
 
2.567924738 1 < 0.1%
 
2.462515116 1 < 0.1%
 
2.43616271 1 < 0.1%
 

D-Dimer
Unsupported

MISSING
REJECTED
UNSUPPORTED
Missing5644
Missing (%)100.0%
Memory size44.2 KiB

pH (arterial blood gas analysis)
Real number (ℝ)

MISSING
Distinct count24
Unique (%)88.9%
Missing5617
Missing (%)99.5%
Infinite0
Infinite (%)0.0%
Mean4.139211145e-10
Minimum-3.568877459
Maximum1.042673826
Zeros0
Zeros (%)0.0%
Memory size44.2 KiB

Quantile statistics

Minimum-3.568877459
5-th percentile-2.2300789
Q1-0.09210582823
median0.2942021191
Q30.5115003437
95-th percentile0.8422765195
Maximum1.042673826
Range4.611551285
Interquartile range (IQR)0.6036061719

Descriptive statistics

Standard deviation1.019049334
Coefficient of variation (CV)2461940931
Kurtosis7.205072951
Mean4.139211145e-10
Median Absolute Deviation (MAD)0.6090046876
Skewness-2.608062119
Sum1.117587045e-08
Variance1.038461545
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-0.09210582823 2 < 0.1%
 
0.3062742651 2 < 0.1%
 
0.1010481417 2 < 0.1%
 
1.042673826 1 < 0.1%
 
-0.2248991877 1 < 0.1%
 
0.6563658118 1 < 0.1%
 
0.02861540392 1 < 0.1%
 
0.2942021191 1 < 0.1%
 
-0.14039433 1 < 0.1%
 
0.704654336 1 < 0.1%
 
Other values (14) 14 0.2%
 
(Missing) 5617 99.5%
 
ValueCountFrequency (%) 
-3.568877459 1 < 0.1%
 
-2.892838478 1 < 0.1%
 
-0.683639884 1 < 0.1%
 
-0.5267022848 1 < 0.1%
 
-0.2248991877 1 < 0.1%
 
ValueCountFrequency (%) 
1.042673826 1 < 0.1%
 
0.8495197892 1 < 0.1%
 
0.8253755569 1 < 0.1%
 
0.704654336 1 < 0.1%
 
0.6563658118 1 < 0.1%
 

HCO3 (arterial blood gas analysis)
Real number (ℝ)

MISSING
Distinct count23
Unique (%)85.2%
Missing5617
Missing (%)99.5%
Infinite0
Infinite (%)0.0%
Mean6.070843449e-09
Minimum-2.985592127
Maximum2.029471397
Zeros0
Zeros (%)0.0%
Memory size44.2 KiB

Quantile statistics

Minimum-2.985592127
5-th percentile-1.374194425
Q1-0.5397210717
median0.05633191019
Q30.5085099936
95-th percentile1.429308486
Maximum2.029471397
Range5.015063524
Interquartile range (IQR)1.048231065

Descriptive statistics

Standard deviation1.01904932
Coefficient of variation (CV)167859594.5
Kurtosis1.849896771
Mean6.070843449e-09
Median Absolute Deviation (MAD)0.7355291886
Skewness-0.6287143435
Sum1.639127735e-07
Variance1.038461516
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.09743914753 2 < 0.1%
 
0.05633191019 2 < 0.1%
 
-0.06698902696 2 < 0.1%
 
-0.3547389209 2 < 0.1%
 
1.289544463 1 < 0.1%
 
0.4262955189 1 < 0.1%
 
-0.7658097148 1 < 0.1%
 
1.453972578 1 < 0.1%
 
-0.6424887776 1 < 0.1%
 
0.2207600772 1 < 0.1%
 
Other values (13) 13 0.2%
 
(Missing) 5617 99.5%
 
ValueCountFrequency (%) 
-2.985592127 1 < 0.1%
 
-1.546844125 1 < 0.1%
 
-0.9713451266 1 < 0.1%
 
-0.889130652 1 < 0.1%
 
-0.8480241895 1 < 0.1%
 
ValueCountFrequency (%) 
2.029471397 1 < 0.1%
 
1.453972578 1 < 0.1%
 
1.371758938 1 < 0.1%
 
1.289544463 1 < 0.1%
 
0.7962591052 1 < 0.1%
 

pO2 (arterial blood gas analysis)
Real number (ℝ)

MISSING
Distinct count27
Unique (%)100.0%
Missing5617
Missing (%)99.5%
Infinite0
Infinite (%)0.0%
Mean-2.469729492e-08
Minimum-1.175907493
Maximum2.205371141
Zeros0
Zeros (%)0.0%
Memory size44.2 KiB

Quantile statistics

Minimum-1.175907493
5-th percentile-1.125698185
Q1-0.8169897795
median-0.1599549055
Q30.4500090033
95-th percentile2.075925398
Maximum2.205371141
Range3.381278634
Interquartile range (IQR)1.266998783

Descriptive statistics

Standard deviation1.019049329
Coefficient of variation (CV)-41261576.72
Kurtosis0.02201809513
Mean-2.469729492e-08
Median Absolute Deviation (MAD)0.8052139068
Skewness0.9734468701
Sum-6.668269626e-07
Variance1.038461536
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-0.1481869966 1 < 0.1%
 
-1.054306865 1 < 0.1%
 
-0.1168060973 1 < 0.1%
 
-0.3364717662 1 < 0.1%
 
-0.4188462496 1 < 0.1%
 
-0.7130410671 1 < 0.1%
 
-0.7993380427 1 < 0.1%
 
-0.8934804201 1 < 0.1%
 
-0.9562419057 1 < 0.1%
 
0.6363323331 1 < 0.1%
 
Other values (17) 17 0.3%
 
(Missing) 5617 99.5%
 
ValueCountFrequency (%) 
-1.175907493 1 < 0.1%
 
-1.156294465 1 < 0.1%
 
-1.054306865 1 < 0.1%
 
-0.9601644874 1 < 0.1%
 
-0.9562419057 1 < 0.1%
 
ValueCountFrequency (%) 
2.205371141 1 < 0.1%
 
2.087693214 1 < 0.1%
 
2.048467159 1 < 0.1%
 
1.538529634 1 < 0.1%
 
0.9501401186 1 < 0.1%
 

Neutrophils/Lymphocytes ratio
Real number (ℝ)

MISSING
Distinct count509
Unique (%)99.2%
Missing5131
Missing (%)90.9%
Infinite0
Infinite (%)0.0%
Mean-0.4817346234
Minimum-151.3453611
Maximum162.82088
Zeros0
Zeros (%)0.0%
Memory size44.2 KiB

Quantile statistics

Minimum-151.3453611
5-th percentile-2.816516921
Q1-1.312218447
median-0.9656280027
Q3-0.6256831556
95-th percentile2.004800259
Maximum162.82088
Range314.1662411
Interquartile range (IQR)0.6865352915

Descriptive statistics

Standard deviation11.6981124
Coefficient of variation (CV)-24.2833125
Kurtosis138.7381219
Mean-0.4817346234
Median Absolute Deviation (MAD)2.301141854
Skewness2.748436893
Sum-247.1298618
Variance136.8458338
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-1.030141363 2 < 0.1%
 
-1.07877585 2 < 0.1%
 
-0.7515946647 2 < 0.1%
 
-0.9944592019 2 < 0.1%
 
-0.3425254882 1 < 0.1%
 
-10.95139785 1 < 0.1%
 
0.6522563213 1 < 0.1%
 
-1.856958323 1 < 0.1%
 
-0.2144534992 1 < 0.1%
 
-1.395476691 1 < 0.1%
 
Other values (499) 499 8.8%
 
(Missing) 5131 90.9%
 
ValueCountFrequency (%) 
-151.3453611 1 < 0.1%
 
-24.99199965 1 < 0.1%
 
-22.15547679 1 < 0.1%
 
-17.78653527 1 < 0.1%
 
-12.11131425 1 < 0.1%
 
ValueCountFrequency (%) 
162.82088 1 < 0.1%
 
95.0380799 1 < 0.1%
 
68.86470916 1 < 0.1%
 
50.28572294 1 < 0.1%
 
22.20188253 1 < 0.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size5.6 KiB
False
5488
True
 
156
ValueCountFrequency (%) 
False 5488 97.2%
 
True 156 2.8%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

Patient age quantileHematocritPlateletsLeukocytesSerum GlucoseUreaProteina C reativa mg/dLCreatinineAlanine transaminaseAspartate transaminaseTotal BilirubinUrine - LeukocytesInternational normalized ratio (INR)Lactic DehydrogenaseD-DimerpH (arterial blood gas analysis)HCO3 (arterial blood gas analysis)pO2 (arterial blood gas analysis)Neutrophils/Lymphocytes ratioGaso performed
013NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNFalse
1170.236515-0.517413-0.09461-0.1406481.198059-0.1478952.089928NaNNaNNaNNaNNaNNaNNaNNaNNaNNaN-1.944575False
28NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNFalse
35NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNFalse
415NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNFalse
59NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNFalse
613NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNFalse
716NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNFalse
81-1.5716821.4296670.36455-0.413208-0.067309-0.286986-1.838623-0.586821-0.1622NaNNaN0.2965460.907723NaNNaNNaNNaN22.201883True
917NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNFalse

Last rows

Patient age quantileHematocritPlateletsLeukocytesSerum GlucoseUreaProteina C reativa mg/dLCreatinineAlanine transaminaseAspartate transaminaseTotal BilirubinUrine - LeukocytesInternational normalized ratio (INR)Lactic DehydrogenaseD-DimerpH (arterial blood gas analysis)HCO3 (arterial blood gas analysis)pO2 (arterial blood gas analysis)Neutrophils/Lymphocytes ratioGaso performed
563415NaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<1000NaNNaNNaNNaNNaNNaNNaNFalse
563512NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNFalse
56366NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNFalse
563712NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNFalse
563814NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNFalse
56393NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNFalse
564017NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNFalse
56414NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNFalse
564210NaNNaNNaNNaNNaNNaNNaNNaNNaNNaN29000NaNNaNNaNNaNNaNNaNNaNFalse
5643190.694287-0.906829-1.288428NaN0.453725-0.50357-0.735872-0.283610.108761-0.480996NaNNaN0.420204NaNNaNNaNNaN-1.287291False